A Local Discrete Text Data Mining Method in High-Dimensional Data Space
نویسندگان
چکیده
Abstract Aiming at the problems of low accuracy, long time required, and large memory consumption traditional data mining methods, a local discrete text method in high-dimensional space is proposed. First all, through preparation preprocessing step, we obtain minimum divergence maximize dimension to meet demand for space; second, use information gain mine pre-processed establish an objective function highest gain; finally, functions established preparation, preprocessing, are combined form multi-objective optimization problem realize mining. The simulation experiment results show that our effectively reduces improves accuracy mining, where it also consumes less memory, indicating can solve multiple improve effect.
منابع مشابه
Mining High-Dimensional Data
With the rapid growth of computational biology and e-commerce applications, high-dimensional data becomes very common. Thus, mining highdimensional data is an urgent problem of great practical importance. However, there are some unique challenges for mining data of high dimensions, including (1) the curse of dimensionality and more crucial (2) the meaningfulness of the similarity measure in the...
متن کاملMining Text Data Mining Text Data
Clustering is a widely studied data mining problem in the text domains. The problem finds numerous applications in customer segmentation, classification, collaborative filtering, visualization, document organization, and indexing. In this chapter, we will provide a detailed survey of the problem of text clustering. We will study the key challenges of the clustering problem, as it applies to the...
متن کاملHigh-Dimensional Clustering Method for High Performance Data Mining
Many clustering methods are not suitable as high-dimensional ones because of the so-called ‘curse of dimensionality’ and the limitation of available memory. In this paper, we propose a new high-dimensional clustering method for the high performance data mining. The proposed high-dimensional clustering method provides efficient cell creation and cell insertion algorithms using a space-partitioni...
متن کاملDesigning a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms
Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...
متن کاملAnalysis of high-dimensional data using local input space histograms
The idea of local input space histograms was recently introduced as a means to augment prototype-based vector quantization methods in order to gather more information about the structure of the respective input space. Here we investigate the utility of this new idea for analysing and clustering highdimensional data. Our results demonstrate that the additional information gained about the input ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computational Intelligence Systems
سال: 2022
ISSN: ['1875-6883', '1875-6891']
DOI: https://doi.org/10.1007/s44196-022-00109-1